Datamining protein structure databanks for crystallization patterns of proteins.

نویسندگان

  • Homayoun Valafar
  • James H Prestegard
  • Faramarz Valafar
چکیده

A study of 345 protein structures selected among 1,500 structures determined by nuclear magnetic resonance (NMR) methods, revealed useful correlations between crystallization properties and several parameters for the studied proteins. NMR methods of structure determination do not require the growth of protein crystals, and hence allow comparison of properties of proteins that have or have not been the subject of crystallographic approaches. One- and two-dimensional statistical analyses of the data confirmed a hypothesized relation between the size of the molecule and its crystallization potential. Furthermore, two-dimensional Bayesian analysis revealed a significant relationship between relative ratio of different secondary structures and the likelihood of success for crystallization trials. The most immediate result is an apparent correlation of crystallization potential with protein size. Further analysis of the data revealed a relationship between the unstructured fraction of proteins and the success of its crystallization. Utilization of Bayesian analysis on the latter correlation resulted in a prediction performance of about 64%, whereas a two-dimensional Bayesian analysis succeeded with a performance of about 75%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequence-Based Protein Crystallization Propensity Prediction for Structural Genomics: Review and Comparative Analysis

Structural genomics (SG) is an international effort that aims at solving three-dimensional shapes of important biological macro-molecules with primary focus on proteins. One of the main bottlenecks in SG is the ability to produce diffraction quality crystals for X-ray crystallography based protein structure determination. SG pipelines allow for certain flexibility in target selection which moti...

متن کامل

A series of PDB-related databanks for everyday needs

We present a series of databanks (http://swift.cmbi.ru.nl/gv/facilities/) that hold information that is computationally derived from Protein Data Bank (PDB) entries and that might augment macromolecular structure studies. These derived databanks run parallel to the PDB, i.e. they have one entry per PDB entry. Several of the well-established databanks such as HSSP, PDBREPORT and PDB_REDO have be...

متن کامل

Grouping of bread wheat cultivars by seed storage proteins. Sonia Kahrizi1, Mohammad Sedghi2* and Omid Sofalian2

To determine seed storage protein banding patterns in some bread wheat cultivars and the similarity of banding patterns among different cultivars, an experiment based on seed storage protein electrophoresis (albumin and globulin) was performed. Water and salt soluble proteins were extracted in sixteen wheat cultivars using polyacrylamide gel electrophoresis and banding pattern was obtained. Stu...

متن کامل

Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...

متن کامل

Study of the Diversity in Different Cultivars of Pistacia vera L. Resistant to Drought and Salinity: Comparing Protein Patterns Using SDS-PAGE Method

Pistachio is one of the most important agricultural products that have always been associated with Iran, and its production has a long historical background in our country. In this research, protein patterns of 10 cultivars of Pistacia vera L. were compared in which cultivars grown in normal conditions where compared with cultivars grown in salinity and water shortage to determine diversity. Fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Annals of the New York Academy of Sciences

دوره 980  شماره 

صفحات  -

تاریخ انتشار 2002